Biostatistics Series Module 7: The Statistics of Diagnostic Tests
نویسندگان
چکیده
Crucial therapeutic decisions are based on diagnostic tests. Therefore, it is important to evaluate such tests before adopting them for routine use. Although things such as blood tests, cultures, biopsies, and radiological imaging are obvious diagnostic tests, it is not to be forgotten that specific clinical examination procedures, scoring systems based on physiological or psychological evaluation, and ratings based on questionnaires are also diagnostic tests and therefore merit similar evaluation. In the simplest scenario, a diagnostic test will give either a positive (disease likely) or negative (disease unlikely) result. Ideally, all those with the disease should be classified by a test as positive and all those without the disease as negative. Unfortunately, practically no test gives 100% accurate results. Therefore, leaving aside the economic question, the performance of diagnostic tests is evaluated on the basis of certain indices such as sensitivity, specificity, positive predictive value, and negative predictive value. Likelihood ratios combine information on specificity and sensitivity to expresses the likelihood that a given test result would occur in a subject with a disorder compared to the probability that the same result would occur in a subject without the disorder. Not all test can be categorized simply as "positive" or "negative." Physicians are frequently exposed to test results on a numerical scale, and in such cases, judgment is required in choosing a cutoff point to distinguish normal from abnormal. Naturally, a cutoff value should provide the greatest predictive accuracy, but there is a trade-off between sensitivity and specificity here - if the cutoff is too low, it will identify most patients who have the disease (high sensitivity) but will also incorrectly identify many who do not (low specificity). A receiver operating characteristic curve plots pairs of sensitivity versus (1 - specificity) values and helps in selecting an optimum cutoff - the one lying on the "elbow" of the curve. Cohen's kappa (κ) statistic is a measure of inter-rater agreement for categorical variables. It can also be applied to assess how far two tests agree with respect to diagnostic categorization. It is generally thought to be a more robust measure than simple percent agreement calculation since kappa takes into account the agreement occurring by chance.
منابع مشابه
INTEGRATING REAL MEDICAL STUDIES INTO TEACHING BIOSTATISTICS A Hands-On Experience
This paper describes an innovative way of teaching Biostatistics (or Biostat) at the undergraduate level. Statistics is a fundamental subject in all courses. In particular, senior students taking up pre-med courses enrol in the subject Biostat. However, there is not much difference between the methods of teaching Biostat and the fundamental statistics. The course content (or curricula) is the s...
متن کاملContrastive analysis of diagnostic tests evaluation without gold stand-ard: review article
Considering the advancement of medical sciences, diagnostic tests have been developed to distinguish patients from healthy population. Therefore, Determining and evaluation of the diagnostic accuracy tests is of great importance. The accuracy of a test under evaluation is determined through the amount of agreement between its results with the results of the gold standard, and this test accuracy...
متن کاملUniserial modules of generalized power series
Let R be a ring, M a right R-module and (S,≤) a strictly ordered monoid. In this paper we will show that if (S,≤) is a strictly ordered monoid satisfying the condition that 0 ≤ s for all s ∈ S, then the module [[MS,≤]] of generalized power series is a uniserial right [[RS,≤]] ]]-module if and only if M is a simple right R-module and S is a chain monoid.
متن کاملDiagnostic testing in the context of high-value care: Incorporating prior probability
This is the fifth article of a series on fundamental concepts in biostatistics and research. In this article, the author reviews the fundamental concepts in diagnostic testing, prior probability and predictive value, and how they relate to the concept of high-value care. The topics are discussed in common language with a minimum of jargon and mathematics. Emphasis is given to conceptual underst...
متن کاملTesting time series linearity: traditional and bootstrap methods
We review the notion of time series linearity and describe recent advances in linearity and Gaussianity testing via data-resampling methodologies. Many advances have been made since the first published tests of linearity and Gaussianity by Subba Rao and Gabr in 1980, including several resampling-based proposals. This article is intended to be instructive in explaining and motivating linearity t...
متن کامل